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(54) File systems with versatile indirection 

(57) A memory storage device has a file storage 
operating system that uses inodes to access file seg- 
ments. Each inode has a plurality of rows. A portion of 
the rows can store extents pointing directly to data 
blocks. Each extent either has a field to indicate whether 
the extent is an indirect extent or a direct extent or has a 
field to indicate an address for a target a logical volume. 
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Description 

Background of the invention 

[0001] This application relates generally to file sys- 
tems, and more particularly, to extent based file sys- 
tems. 

[0002] Computer systems manipulate and store 
data files that often include a sequence of file segments. 
Each file segment occupies a consecutive sequence of 
physical storage blocks. The different file segments 
may, however, be stored at widely separated physical 
storage locations. 

[0003] A file system makes the details of data stor- 
age of data files simpler for software application pro- 
grams. The file system enables high-level applications 
to address stored data through abstract concepts such 
as directory name, file name, and offset rather than 
through actual physical storage addresses. This system 
for addressing data storage makes software applica- 
tions less dependent on how data is physically stored so 
that the applications are less tied to the physical storage 
system and more portable, 

[0004] FIG. 1 illustrates a file system that UNIX 
based systems employ to translate between abstract file 
names and physical storage addresses. The file system 
perfomris translations with the aid of two types of struc- 
tures, which are stored on a data storage device 1 0. The 
first type of structure is a directory 12, which maps 
abstract directory names and file names to other direc- 
tories 13 and index nodes (inodes) 15, 16, respectively. 
The second type of structure is the mode 15, 16, which 
maps abstract file segments to the physical data blocks 
1 7-1 9 storing the segments. 

[0005] The modes 15, 1 6 include lists of extents 21- 
27. By definition, the consecutive extents 21-24 of each 
mode 1 5 correspond to consecutive file segments and 
indicate the storage addresses of the segments by an 
address pointer and a length. The address pointer indi- 
cates the physical address of the first data block, for 
example, blocks 55, 58, 5 storing the file segment. The 
length indicates the number of consecutive data blocks 
assigned to store the segment. For example, the extent 
21 , which points to the address of the data block 55 and 
has length three, includes the three data blocks 55-57. 
[0006] Each inode 15, 16 can also include one or 
more special extents 24, 27 stored at special positions 
of the mode 15,16, that is, the last rows allocated in the 
modes for extents. The special extents 24, 27 point to 
data blocks that store additional extents. For example, 
the special extent 24 points to the data block 97 that 
stores additional extents 18-20, 28. The additional 
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segments, and enable extending the end of the file to 
increase the associated file's size. The last extent 28 of 
the data block 95 can also be a special extent thereby 
providing for further extensions of the end of the file. 
[0007] Some file systems translate between large 



files and physical storage. FIG. 2 illustrates a file system 
30 capable of translating an abstract file 31 to data 
blocks stored on multiple physical disks 32, 33. To pro- 
vide enough storage space for the large file 31, the file 

5 system 34 interacts with an intermediate abstraction 
layer, a virtual logical volume 35, which translates phys- 
ical space 36-37 in the separate physical disks 32-33 
into a single virtual space 38. Then, software applica- 
tion 39, which accesses the file 31 , sees the single large 

JO virtual volume 35 and is unaware of the separate 
devices 32-33. 

[0008] One objective of the present invention is to 
provide a file system that gives a more flexible method 
for extending an existing file. 
15 [0009] Another objective of the present invention is 
to provide a file system adapted to storing large files. 

Summary of the Invention 

20 [0010] In a first aspect, the invention provides a 
memory storage device, which uses at least one mode 
for accessing file segments in storage devices. Each 
mode has a plurality of rows. A portion of the rows store 
extents pointing to data blocks. Each extent has a field 

25 to indicate whether the extent is an indirect extent or a 
direct extent. 

[001 1] In a second aspect, the invention provides a 
method for storing data files, which is performed by an 
operating system stored in a memory device. The 

30 method includes steps for writing extents to an inode 
assigned to the file, writing data to first and second data 
blocks, inserting an indirect extent in the mode between 
first and second ones of the extents, and writing a third 
extent to a third data block. The first and second ones of 

35 the extents point to the first and second data blocks. 
The indirect extent points to the third data block. The 
third extent points to a data block storing a segment of 
the file. 

[0012] In a third aspect, the invention provides a 
40 distributed storage system. The storage system 
includes a global cache memory, a plurality of proces- 
sors coupled to the global cache memory, and a plural- 
ity of data storage devices coupled to the global cache 
memory. Each processor has a local memory for storing 
45 an operating system. The devices and processors are 
capable of communicating by posting messages to each 
other in the cache memory. Each of the devices has a 
processor and local memory storing an operating sys- 
tem. Each operating system includes an extent based 
50 file system for abstracting file names to physical data 
blocks in the storage devices. 

[0013] In a fourth aspect, the invention provides a 
memory storage device having an operating system,, 
which uses one inode to record and find segments of a 
55 data file. Each mode has a plurality of rows. Part of the 
rows are written with direct extents, which point to data 
blocks storing portions of file segments. At least two of 
the extents point to data blocks whose addresses are in 
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different logical volumes. 

[0014] In a fifth aspect, the invention provides a 
memory storage device, which uses at least one mode 
for accessing file segments in storage devices. Each 
mode has a plurality of rows. A portion of the rows store 5 
extents pointing to data blocks. Each extent has a field 
to indicate whether the extent is an indirect extent or a 
direct extent and a field for a logical volume pointer. 
[0015] In a sixth aspect, the invention provides a 
method for storing data files, which is performed by an 
operating system stored in a memory device. The 
method includes steps for writing extents to an mode 
assigned to the file, writing data to first and second data 
blocks, inserting an indirect extent in the inode between 
first and second ones of the extents, and writing a third 
extent to a third data block. The first and second ones of 
the extents point to the first and second data blocks. 
The indirect extent points to the third data block. The 
third extent points to a data block storing a segment of 
the file. 

[0016] In a seventh aspect, the invention provides a 
distributed storage system. The storage system 
includes a global cache memory, a plurality of proces- 
sors coupled to the global cache memory, and a plural- 
ity of data storage devices coupled to the global cache 
memory Each processor has a local memory for storing 
an operating system. The devices and processors are 
capable of communicating by posting messages to each 
other in the cache memory. Each of the devices has a 
processor and local memory storing an operating sys- 
tem. Each operating system includes an extent based 
file system for abstracting file names to physical data 
blocks in the storage devices by assigning an inode to 
each file. Each inode adapted to store extents having a 
field to pointing to a logical volume. 

Brief Description of the Drawings 

[0017] Other objectives, features, and advantages 
of the invention will be apparent from the following 
description taken together with the drawings, in which: 

FIG. 1 illustrates physical structures used by a prior 
art file system to abstract data storage; 
FIG. 2 schematically illustrates a prior art method 
for abstracting large files; 

FIG. 3 illustrates a distributed storage system hav- 
ing a global cache memory; 
FIG. 4 illustrates how the file system of the distrib- 
uted storage system of FIG. 3 translates large files 
to physical storage volumes; 
FIG. 5 illustrates physical structures used by the file 
system of FIGs. 3 and 4; 

FIGs. 6A and 6B illustrate the format of the extents 
in the inode of FIG. 5; 

FIG. 7 illustrates the use of direct, hole, and indirect 
extents by the file system of FIGs. 3, 5, 6A and 68; 
FIGs. 8A and 88 illustrate how indirect extents ena- 



ble expansions of a file at middle points; 
FIG. 8C is a flow chart illustrating a method of 
expanding a file with indirect extents; 
FIG. 9 is a schematic illustration of nesting of indi- 
rect extents; 

FIG. 10 is a flow chart illustrating a method of stor- 
ing a file in multiple logical volumes; and 
FIG. 1 1 illustrates one embodiment of the header of 
an inode. 

Description of the Preferred Embodiments 

[0018] FIG. 3 illustrates a distributed storage sys- 
tem 40 in which a global cache memory 42 couples to a 
plurality of processors 44, 45 and storage device drivers 
47-49. Each processor 44, 45 and driver 47-49 has a 
central processing unit (CPU) to control input/output 
(1/0) with the global cache memory 42 and permanent 
memory, for example, ROM or EPROM, storing microc- 
ode. The microcode includes an operating system (OS) 
with a file system. The various device drivers 47-49 may 
have multiple storage disks arranged, for example, in 
RAID configurations. 

[0019] The global cache memory 42 provides a 
symmetric environment for communications between 
the processors 44, 45 and the drivers 47-49. The proc- 
essors 44, 45 and drivers 47-49 send requests to and 
respond to requests from the other processors 44, 45 
and drivers 47-49 by writing messages in predeter- 
mined locations of the global cache memory 42. The 
messages identify the intended recipients by physical 
addresses. The recipients recognize and read the mes- 
sages posted in the global cache memory 42. Thus, the 
global cache memory 42 acts like a blackboard on 
which the processors 44, 45 and drivers 47-49, which 
recognize each other as separate logical devices, write 
messages to each other. 

[0020] FIG. 4 illustrates how the file system 50 on 
each processor 44, 45 and driver 47-49 of FIG. 3 can 
map file segments of one file to different logical devices 
and volumes. For example, the file system 50 maps dif- 
ferent segments of the file 51 to different ones of the 
drivers 47-48. The file system 50 translates the abstract 
file name and offset for the file 51 directly to physical 
segments 52, 53 stored on the different drivers 47-48 
without creating a virtual volume, unlike the file system 
34 shown in FIG. 2. Since the file system identifies the 
driver 47-48 storing each segment 52, 53, the proces- 
sors 44-45 and drivers 47-49 address those drivers 47- 
48 directly to manipulate the segments of the file 51. 
[0021] FIG. 5 illustrates physical structures that the 
file system of FIG. 3 uses to translate between abstract 
files and physical data blocks. The physical structures 
include directories 61, 62 and modes 63, 64. Each 
directory 61 translates abstract file names and directory 
names to physical addresses of inodes 63, 64 and 
directories 62, respectively. Each mode 63, 64 stores a 
list of extents 65-66, which map consecutive file seg- 
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ments to strings of physical data blocks 80-82, 83-84, 
92-94. 

[0022] The physical directories 61, 62 and inodes 
63, 64 are stored in the global cache memory 42. Cop- 
ies of the relevant directories 61, 62 and/or inodes 63, 
64 may also be stored locally to volatile memory of the 
processors 44, 45 and drivers 47-49. The locally stored 
copies speed up I/O by the various local operating sys- 
tems. 

[0023] Each data bloclc 80-84, 92-94 has the same 
size, for example, 4K bytes. Nevertheless, the extents 
65-66 can map file segments of different sizes to physi- 
cal storage locations. To handle file segments of differ- 
ent sizes, each extent has a length field that indicates 
the number of data bloclcs in the string of data blocks 
that stores the associated file segment. 
[0024] The various extents 65, 66 of each mode 63, 
64 may map to data blocks 80-84, 92-94 of different log- 
ical volumes LV1 , LV2. For example, the extents 1 and 2 
of the mode 63 map to the data blocks 80-83 in a first 
logical volume LV1 , and the extent 3 of the same mode 
63 maps to data blocks 92-93 in a second logical vol- 
ume LV2. The different extents 65, 66 can map different 
segments of a single abstract file to different ones of the 
drivers 47-49 and to different physical disks and parti- 
tions therein. 

[0025] FIGs. 6A and 6B illustrate the format of the 
extents in the modes 63, 64 of FIG. 5. Each extent of the 
illustrated embodiment has three fields including an 
address pointer field, a length field, and a flag field. 
[0026] The address pointer field indicates both a 
logical volume and a physical offset of a data block in 
the logical volume. In one embodiment, the pointer 
fields for the logical volume and the data block therein 
are 2 bytes and 4 bytes long, respectively. For this field 
size and data blocks of 32 kilobytes, the extent fields 
can identify about 140x10^^ bytes of data in each of 
about 64K different logical volumes. Thus, the file sys- 
tem of the distributed storage system 40 can handle 
very large files. 

[0027] The length field indicates the number of con- 
secutive data blocks in the string assigned to a file seg- 
ment. In the above-described embodiment, the length 
field is 4 bytes long and thus, distinguishes a wide range 
of string lengths. If the values of the length field equal 
the number of data blocks in the associated string, 
strings can include from one data block to about 4x1 0® 
data blocks. 

[0028] In the above-described embodiment, the flag 
field uses two bytes to characterize types of data blocks 
pointed to by an extent. A first portion of the flag field 
indicates whether the data blocks are locked or 
unlocked, that is, available or unavailable. The locked 
designation indicates that access to the data blocks is 
limited. The processors 44-45 and drivers 47-49 may 
change the flag field of an extent to the locked designa- 
tion while manipulating data in the associated data 
blocks so that other devices do not access the data 



blocks in parallel. A second portion of the flag field indi- 
cates whether empty data blocks have been zeroed. By 
using the not zeroed designation, the file system can 
allocate a data block to a file without zeroing the block 

5 beforehand. If a subsequent access writes the entire 
data block, the block will not have to be zeroed saving 
processing time. A third portion of the flag field catego- 
rizes the data type stored in a data block into one of 
three types, that is, real file data, non-data, or extents. 

10 [0029] FIG. 7 illustrates the relationship between 
the third portion of the flag field and the data type of the 
data blocks pointed to by an extent. If data blocks 1 00 
have real data for the associated file, the third portion of 
the flag field indicates that the associated extent 1 01 is 

75 a direct extent. If the data blocks 102 are not yet allo- 
cated, the third portion of the flag field indicates that the 
associated extent 1 03 is a hole extent. The hole extent 
is useful for reserving a range of offsets of a file without 
consuming disk space to back up the offsets. Finally, if 

20 the data blocks, for example data block 1 05, store more 
extents, the third portion of the flag field indicates that 
the extent, here extent 1 06, is an indirect extent 
[0030] FIGs. 8A and 88 show how the operating 
system uses indirect extents to grow the middle of a file. 

25 FIG. 8A shows an inode 110 assigned to the file. The 
mode 1 1 0 has consecutive direct extents 111, 113, 119 
that point to data blocks 1 14, 215, 330 storing originally 
consecutive segments of the file. FIG. 88 shows the 
final file in which an indirect extent 112 has been 

30 inserted between the two original direct extents 111, 
119. The indirect extent 112 points to more extents 
stored in a data block 116. These extents, in turn, point 
to new data block 1 1 7 and original data block 21 5. Since 
the indirect extent 1 12 is physically located between the 

35 two original extents 111, 119, the segments stored in 
the blocks 117, 215 (indirectly pointed to) are logically 
located between the original segments stored in the 
blocks 114, 330. Inserting the indirect extent 112 has 
grown the middle of the associated file by logically 

40 inserting the segment in new data block 117 between 
the originally consecutive segments in data blocks 114 
and 215. 

[0031] The file system, illustrated in FIGs. 5-88, 
allows any extent of an mode to be indirect, because the 

45 flag field indicates the type of each extent. This free 
placement of indirect extents within the modes enables 
an operating system to logically insert a new data seg- 
ment between any two selected data segments of a file 
without physically moving data blocks. To insert a new 

50 data segment, the system inserts an indirect extent into 
the file's inode between the two extents for the selected 
data segments. Then, the system makes the indirect 
extent point to a data block storing new direct extents 
that point, in turn, to the consecutive pieces of new data 

55 segment. The new direct extents are logically located in 
the mode at the point where the new indirect extent has 
been inserted. 

[0032] Since the insertion of the new segments 
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does not Involve moving previously stored file seg- 
ments, file expansions can be less time intensive and 
more convenient with the present file system than in 
prior art file systems. Prior art file systems that 
expanded files either by moving data blocks of file data 5 
or by appending file data to the end of the file often 
required substantial time to move previously stored 
data. 

[0033] FIG. 8C is a flow chart illustrating a method 
130 of inserting a new file segment between two adja- 10 
cent file segments. To insert the new segment, the oper- 
ating system first determines whether at least one 
empty row remains for writing a new extent to the file's 
mode, for example to mode 110 of FIG. 8A (step 132). 
In FIG. 8A, the operating system would determine that is 
the inode 1 1 0 does not have an empty row. 
[0034] If the mode has an empty row, the operating 
system shifts down the original extents corresponding 
to segments that will follow the segments to be inserted 
by one row in the mode (step 134). Then the operating 20 
system inserts a new direct extent in the newly emptied 
row of the mode (step 136). Rnally the operating sys- 
tem writes the new file segment to a new data block 
pointed to by the new direct extent (step 138). 
[0035] On the other hand, if the mode does not 25 
have an empty row, e.g., the case of FIG. 8A, the oper- 
ating system selects a new, available, data block to use 
as an indirect block (step 140). In FIG. 8A, the new indi- 
rect block is the block 116. Then, the operating system 
writes the extent following the point of insertion to the 30 
second row of the new indirect block (step 142). In FIG. 
8B, the operating system writes the extent 113 to the 
second row of the data block 116. Next, the operating 
system writes a new direct extent in the first row of the 
indirect block (step 144). In FIG. 8B, the operating sys- 35 
tern writes the new extent to the first row for extents in 
the indirect block 116. 

[0036] Next, the operating system inserts an indi- 
rect extent into the row of the inode previously occupied 
by the extent now in the second row of the indirect block 40 
(step 146). The new indirect extent points to the new 
indirect block and has a length equal to the sum of the 
lengths of both extents in the indirect block. In FIG. SB, 
the operating system writes the extent 112 pointing to 
the data block 1 1 6 to the mode 1 1 0. Rnally the operat- 45 
ing system writes the new file segment in the new data 
block pointed to by the new direct extent (step 148). In 
FIG. 88, the new file segment is written to the data block 
117. 

[0037] FIG. 9 illustrates an example where the file so 
system nests indirect extents. In the example, the mode 
110 includes indirect extent 120, which points to data 
block 121. In turn, block 121 includes indirect extent 
122, which points to data block 123, and block 123 
includes indirect extent 1 24, which points to block 1 25. ss 
[0038] Nesting indirect extents enables growing a 
file between any two original file segments without size 
limits. Nesting also introduces extra costs during 
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accesses. Each access to a file segment pointed to by 
nested indirect extents costs extra look ups and addi- 
tional look up time. 

[0039] FIG. 10 is a flow chart illustrating a method 
150 of allocating data blocks to a file from a plurality of 
logical volumes, for example, the volumes LV1, LV2 
shown in FIG. 5. The operating system assigns an 
mode to the file by writing the mode address and the file 
name to a row in a directory (step 152). In FIG. 5, the 
operating system wrote the mode address for the mode 
63, in entry of the root directory 61 for file name A. The 
operating system selects a logical volume with a larger 
than average contiguous region of available data blocks 
(step 154). The operating system determines the maxi- 
mum number of available contiguous blocks in each log- 
ical volume from data in the volume's header or from 
Information in a superblock spanning the entire storage 
system. The operating system allocates a string of data 
blocks from the contiguous region of the selected vol- 
ume to the file by writing an extent, which points to the 
string, in the first row of the mode assigned to the file 
(step 156). The extent Indicates both the logical volume 
and an offset of the first data block of the string of blocks 
within the selected logical volume. 
[0040] Later, a request from a software application 
for more data blocks for the file is received by the oper- 
ating system (step 157). In response to the request, the 
operating system determines whether the region contig- 
uous to the physical location of the previous segment of 
the file has more available data blocks (step 158). If 
region has more available blocks, the operating system 
allocates a new string of blocks immediately following 
the physical location previous segment, i.e., contiguous 
with the previous segment (step 1 60). Then, the operat- 
ing system Increases the value of the length stored in 
the length field of the previous extent for the region by 
the number of blocks in the new string (step 161). If no 
blocks contiguous to the previous segment are availa- 
ble, the operating system again searches for a logical 
volume with a larger than average contiguous region of 
available data blocks (step 162). The newly found logi- 
cal volume may be a different logical volume. Thus, the 
new string of data blocks may be allocated to the file 
from a different logical volume. 
[0041] In some embodiments, the allocation of 
more modes is dynamic and stimulated by potential 
need. This dynamic allocation results in less waste of 
storage space by unused modes. Dynamic allocation 
also Implies that the physical addresses of the modes 
are not predetermined. Since the physical addresses 
are not predetermined, separate structures record the 
address of each inode for use by the operating system 
in the event of a system failure. 
[0042] FIG. 11 illustrates the headers 170 of one 
embodiment of the Inodes 63, 64 of FIG. 5. The headers 
170 provide the separate structures used to record the 
addresses of each inode. Each header 170 has entries 
1 72, 1 74 for the addresses of the next inode to be allo- 
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cated and of the previously allocated inode, respec- 
tively. These entries are written to the header 1 70 when 
the associated inode is first allocated. 
[0043] By performing a serial chain search on the 
entries 172, 174 of the headers 170 of each inode, an 
operating system can find the addresses of each inode 
by using a predetermined address for the first allocated 
inode. The serial chain search finds modes sequentially 
by hopping from found inode to found inode. After a sys- 
tem failure, the serial chain search enables a sequential 
reconstruction of the control data structures of the file 
system. 

[0044] FIG. 1 1 also illustrates a entry 1 76 of the 
inode header 170. The entry 176 is binary valued and 
indicates whether the inode stores a data file or a list of 
extents. One binary value of the entry 1 76 indicates that 
the inode stores a list of extents for the associated data 
file, and the other binary value indicates that the inode 
stores the data file itself. Thus, each inode can either 
store a list of extents or a small data file. 
[0045] The operating system writes the binary value 
to the third entry 176 to indicate storage of a data file 
when the associated inode is first created. Then, the 
operating system uses the inode to store the associated 
data file. When the size of the data file surpasses the 
limited space available in the inode, the operating sys- 
tem converts the inode to an inode for storage of lists of 
extents, 

[0046] To perform the conversion, the operating 
system moves any data already stored in the inode to 
data blocks, writes extents in the inode to point to the 
data blocks, and changes the entry 176 to indicate 
extent storage. To store more data of the associated 
data file, the operating system writes more extents 
sequentially to the mode and stores the new data seg- 
ments in the data blocks to which the new extents point. 
[0047] Storing small data files in an inode directly 
reduces access times for data. Data retrieval from such 
files does not require a separate search for an mode 
and a data block. Thus, employing unused modes to 
store small data files reduces the amount of time 
needed for look ups. The cost of constructing an inode 
that can store either a list of extents or a data file is 
small. The cost is one more entry in the inode's header 
170. 

[0048] Other additions, subtractions, and modifica- 
tions of the described embodiments may be apparent to 
one of ordinary skill in the art. 

Claims 

1 , A memory storage device having an operating sys- 
tem which uses at least one inode for accessing file 
segments, the inode comprising: 

a plurality of rows; and 

a portion of the rows storing extents pointing to 
data blocks, each extent having a field to indi- 



cate whether the extent is an indirect extent or 
a direct extent. 

2. An automated method of storing data files in a 
5 memory storage system, comprising: 

assigning an inode to a data file to be stored; 
and 

writing a plurality of extents in the inode, each 
10 extent pointing to a string of one or more data 

blocks for storing a segment of the data file and 
having a field for indicating that the extent is 
one of an indirect extent and a direct extent. 

15 3. A method of storing data files, comprising: 



writing a plurality of extents to an inode; 
writing first and second data segments of the 
file to first and second physical data blocks, first 
and second ones of the extents pointing to the 
first and second physical data blocks; 
shifting at least one of the extents in the inode 
to leave a space between the first and second 
ones of the extents in response to the inode 
having at least one empty row; and 
inserting a third direct extent in the space in 
response to the inode having an empty row 
between the first and second extents, the direct 
extent pointing to a third data block to store a 
new data segment of the file. 
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4. In a memory storage device employing an operat- 
ing system comprising instructions for a method for 
storing data files, the method comprising: 

writing a plurality of extents to an inode 
assigned to a file; 

writing data to first and second data blocks, first 
and second ones of the extents pointing to the 
first and second physical data blocks; 
inserting an indirect extent in the inode 
between the first and second ones of the 
extents, the indirect extent pointing to a third 
physical data block; and 
writing at least one extent to the third physical 
data block, the one extent pointing to a physical 
data block storing a segment of the file. 

5. The method of claim 4, further comprising: 

writing a length field to each extent, the length 
field fixing the number of consecutive data 
blocks pointed to by the extent. 



55 6. A distributed storage system, comprising: 

a global cache memory; 

a plurality of processors coupled to the global 
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cache memory, each processor having a local 
memory for storing an operating system; and 
a plurality of data storage devices coupled to 
the global cache memory, the devices and 
processors capable of communicating by post- 
ing messages to each other in the cache mem- 
ory, each of the devices having a processor 
and local memory storing an operating system, 
each operating system including an extent 
based file system for abstracting file names to 
physical data blocks in the storage devices. 

A memory storage device having an operating sys- 
tem which uses at least one inode to record and 
find segments of a data file, the inode comprising: 

a plurality of rows, and 

a portion of the rows being written with direct 
extents pointing to data blocks storing portions 
of file segments, at least two of the extents 
pointing to data blocks having addresses in dif- 
ferent logical volumes. 

A memory storage device having an operating sys- 
tem which uses at least one inode for accessing file 
segments, the inode comprising: 

a plurality of rows; and 

a portion of the rows storing extents pointing to 
data blocks, each extent having a field to indi- 
cate whether the extent is an indirect extent or 
a direct extent, each direct extent of the inode 
further comprises a field for a logical volume 
pointer. 



An automated method of storing data files 
memory storage system, comprising: 



in a 



assigning an inode to a data file; and 
writing a plurality of extents in the inode, each 
extent pointing to a sequence of one or more 
data blocks for storing a segment of the data 
file, at least two of the extents pointing to differ- 
ent logical volumes. 

10. An automated method of storing data files in a 
memory storage system, comprising: 

assigning an inode to a data file to be stored; 
and 

writing a plurality of extents in the inode, each 
extent pointing to a string of one or more data 
blocks for storing s segment of the data file and 
having a field for indicating that the extent is 
one of an indirect extent and a direct extent, the 
act of writing includes writing an address 
pointer in each extent, the address pointer indi- 
cating a logical volume and an offset of a data 
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block in the logical volume. 

11. In a memory storage device employing an operat- 
ing system comprising instructions for a method for 
storing data files, the method comprising: 

writing a plurality of extents to an mode 
assigned to a file, at least two of the extents 
pointing to different logical devices; 
writing data to first and second data blocks, first 
and second ones of the extents pointing to the 
first and second physical data blocks; 
inserting an indirect extent in the inode 
between the first and second ones of the 
extents, the indirect extent pointing to a third 
physical data block; and 
writing at least one extent to the third physical 
data block, the one extent pointing to a physical 
data block storing a segment of the file. 

12. The method of claim 11, further comprising: 

writing a start address field to each extent, the 
start address field including a pointer to a logi- 
cal volume portion and a pointer to a data block 
in the logical volume; and 
writing a length field to each extent, the length 
field fixing the number of consecutive data 
blocks pointed to by the extent. 

13. A distributed storage system, comprising: 

a global cache memory; 
a plurality of processors coupled to the global 
cache memory, each processor having a local 
memory for storing an operating system; and 
a plurality of data storage devices coupled to 
the global cache memory, the devices and 
processors capable of communicating by post- 
ing messages to each other in the cache mem- 
ory, each of the devices including a processor 
and local memory storing an operating system, 
each operating system having an extent based 
file system for abstracting file names to physi- 
cal data blocks in the devices by assigning an 
inode to each file, each inode adapted to store 
extents having a field to point to a logical vol- 
ume. 
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